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Abstract 

This article addresses regularity of optimal transport maps for cost= "squared distance" 
on Ricmannian manifolds that are products of arbitrarily many round spheres with arbi- 
trary sizes and dimensions. Such manifolds are known to be non-negatively cross-curved 
|KM2j . Under boundedness and non- vanishing assumptions on the transfered source and 
target densities we show that optimal maps stay away from the cut-locus (where the cost 
exhibits singularity), and obtain injectivity and continuity of optimal maps. This together 
with the result of Liu, Trudinger and Wang |LTW| also implies higher regularity (C 1,Q /C 00 ) 
of optimal maps for more smooth (C Q /C°°) densities. These are the first global regularity 
results which we are aware of concerning optimal maps on non-flat Riemannian manifolds 
which possess some vanishing sectional curvatures. Moreover, such product manifolds have 
potential relevance in statistics (see [S]) and in statistical mechanics (where the state of a 
system consisting of many spins is classically modeled by a point in the phase space obtained 
by taking many products of spheres). For the proof we apply and extend the method devel- 
oped in |FKM1| , where we showed injectivity and continuity of optimal maps on domains in 
W 1 for smooth non-negatively cross-curved cost. The major obstacle in the present paper is 
to deal with the non-trivial cut-locus and the presence of flat directions. 
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1 Introduction 

Let M and M be ra-dimensional complete Riemannian manifolds, and let p = pvoljv/ and 
v = pvol^ be two probability measures whose densities p and p are bounded away from zero 
and infinity. Given a cost function c : M x M —> R, the optimal transport problem with cost 
c(x, y) consist in finding a transport map T : M — >• M which sends p onto v and minimizes the 
transportation cost 

/ c(x, T(x)) dp(x). 
Jm 

As shown by McCann [M] extending the result of Brenier [Br] on R n , if M = M and c = 
dist 2 /2 then the optimal transport map (or simply optimal map) exists and is unique. More 
generally, the same result holds if the cost is semiconcave and satisfies the twist condition in 
Assumption EH see [Lj IMTW1 IFFl E] . 

The optimal map T is uniquely characterized by the relation T(x) G d c (p(x), where <f> is a 
c-convex function (called potential) and d c cj) denotes its c-subdifferential (see Section [2] for the 
definitions). Furthermore, the fact that p and p are bounded away from zero and infinity ensures 
the existence of a constant A > such that the following Monge- Ampere type equation holds: 

A|fi| < |<9 c ^(^)j < T |n| Vft CM Bond, 
A 

where d c (ft(Q,) = U X £nd c (p(x). (See for instance [FKMll Lemma 3.1].) 

The aim of this paper is to investigate the regularity issue of optimal maps when M = M are 
multiple product of spheres, i.e., M = M = S^ 1 x . . . x 8JJ* and c(x,y) = /(dist (a;, y)) for some 
function /, including the case f(t) = t 2 /2 of distance squared cost. For k = 1 and f(t) = t 2 /2, 
smoothness of optimal maps has been proved by Loeper |L2| . However, if k > 1 the structure 
of the cut-locus (the singular set of the cost function) becomes more complicated, and due to 
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the product structure, the manifold has both flat and positively curved directions, thus making 
the regularity issue much more delicate. Especially, the powerful Holder regularity estimate of 
Loeper |Llj (see also [Li]) as well as the a priori estimates of Ma, Trudinger and Wang |MTW] . 
which are successfully applied to positively curved manifolds as in \L2\ IKM21 ILVt IFRl \DG\ IFRV] , 
are not available any more in our setting. Our main results (Theorem 15.11 and Corollary I5.3P 
give the first global regularity results which we are aware of concerning optimal maps on non-flat 
Riemannian manifolds which allow vanishing sectional curvature. For completely flat manifolds 
(with c = dist 2 /2) the regularity of optimal maps is known as it reduces to the regularity theory 
of the classical Monge- Ampere equation [DH [Call ICa2l ICa3l El [Call ICdllD2l lG]. 

To describe our result more precisely, first recall that in |MTW| Ma, Trudinger and Wang 
discovered condition (A3) on the cost function, whose weaker variant (A3w) |TW] turned 
out to be both necessary |Llj and sufficient |TW] for regularity when the solution <p is known 
to be strictly c-convex and the cost function is smooth. When M = M = Sp, the particular 
structure of the cut-locus (for every point x, its cut-locus Cut(x) consists of its antipodal point) 
allowed Delanoe and Loeper |DL] to deduce that optimal maps stay away from cut-locus, namely 
d c 4>(x) n Cut(x) = for all x G M; see [L2 | iDGl IKMl] IKMlaj for alternate approaches. Loeper 
[L2j combined this observation with the fact that c = dist 2 /2 satisfies (A3) to show regularity 
of optimal maps; for a simpler approach to continuity, see |KMll IKMlaj . His result has been 
extended to variety of positively curved manifolds including the complex projective space |KM2| 
and perturbation of the real projective space |LV] and of sphere [FR^ IDGl IFRV] . all of where 
(A3) holds thus the strong Holder regularity estimate of |L1] as well as the a priori estimate 
of [MTW] applies. Note that (A3) (resp. (A3w)) forces the sectional curvature to be positive 
(resp. nonnegative) [Llj . though the converse does not hold [K] . 

On multiple products of spheres, taking c = dist 2 /2 leads to two main issues: first, only a 
degenerate strengthening of the weak Ma- Trudinger- Wang condition holds (the so-called non- 
negative cross- curvature condition in |KMlj[KM2] ). which although stronger than (A3w) is not 
as useful as (A3) for proving regularity due to lack of powerful estimates; (neither non-negative 
cross-curvature nor (A3) implies the other, though either one separately implies (A3w)). More- 
over, the cut-locus now has a non-trivial structure, which makes it much more difficult to un- 
derstand whether the stay-away property holds. In [FKM1] we showed strict c-convexity and 
C 1 regularity of <fi, or equivalently, injectivity and continuity of T, when the cost is smooth 
and non-negative cross-curvature holds. Hence the only question left is whether d c (p avoids the 
cut-locus or not. 

In this paper we answer this question positively: by taking advantage of the fact that the 
cut-locus is given by the union of certain sub-products of spheres we prove in Theorem 15.11 the 
stay-away property that d c cp(x) n Cut(x) = for all x € M. By compactness, these two sets 
are separated by a uniform distance that is dependent on A, but independent of the particular 
choice of (j) and x; see Corollary 15.21 Once stay-away property is shown, one can localize 
the argument of [FKMlj to obtain injectivity and continuity of the optimal map; then higher 
regularity (C 2 ' a /C°°) of 4>, thus C 1,a /C°°-regularity of T, follows from |LTW| when the densities 
are smooth (C a /C°°); see Corollary 15.31 

The multiple products of spheres is a model case for more general manifolds on which the 
cost c satisfies the necessary conditions |L1| IFRVj for regularity of optimal transport maps. The 
method we develop in this paper demonstrates one approach to handling complex singularities 
of the cost, especially the stay-away property of optimal maps. Moreover, a general Alexandrov 
type estimate (Lemma 14. ip is obtained which has applications beyond the products of spheres. 

Our regularity result has potential relevance to statistics and statistical mechanics. For 
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instance, recently T. Sei applied optimal transport theory for c = dist 2 to directional statistics 
on the sphere. In his main result [HI Theorem 1], he needed the optimal map not to touch 
the cut-locus. Now, our stay-away property on multiple products of spheres M (Theorem 15. 1|) 
states that all optimal maps, obtained by transporting densities bounded away from zero and 
infinity onto each other, satisfy this assumption. Hence, this provides a large family of c-convex 
potentials that could be used to create log-concave likelihood functions as in [Sj Subsection 
3.2], extending his theory to multiple products of spheres. Namely, as a direct consequence 
of [KM2j IFKM21 Ej, on multiple products of spheres a convex combination <p — X^i=i Si&ii 
Si > 0, s i = 1) °f c-convex functions (pi is again c-convex, thus a crucial requirement in 
Sei's theory is satisfied. If each is the c-potential of an optimal map between densities 
bounded away from zero and infinity, by Theorem 15.11 one sees d c 4>i stay away from the cut- 
locus. One then can show that d c 4> also avoids the cut-locus, thus applying [Si Theorem 1] 
one obtains the log-concave Jacobian inequality for this convex combination. To see this, for 
example, observe that in the product of spheres the domain of exponential map is convex and 
d c 4> satisfies d c (p(x) = exp x dcj>(x) = exp x [ ^ dfa (x)] ) for x G M (see Lemma [2~77|) . Since each 
d c 4>i stays away from the cut-locus, d<pi(x) belongs to the domain of exponential map, so does 
d(j)(x), showing d c (j)(x) n Cut(x) = 0. 

Concerning statistical mechanics, let us recall that the state of a spin system is classically 
modeled as a point in the phase space M obtained by taking many products of spheres. In such 
contexts, optimal transport may provide a useful change of variables. More precisely, if [i and v 
are two smooth densities and T denotes the optimal transport map from fi to v, then 

J G(y)du(y) = j G(T(x))dn(x), 

for all bounded measurable functions G : M — > K. Then, if /x is a "nice" measure for 
which many statistical quantities are easily computable, one may hope to exploit some quali- 
tative/quantitative properties of T in order to estimate the integral J G(y) dv(y) by studying 
f G(T(x)) dfi(x). We expect that regularity of optimal maps may play a crucial role in this 
direction. For instance, in Euclidean spaces this is already the case, as Caffarelli |Ca5| used 
regularity of optimal maps to show that suitable monotonicity and log-concavity properties of 
the densities imply monotonicity and contraction properties for the optimal map, from which 
correlation and momentum inequalities may be deduced. 

Organization of the paper: Section [2] sets up the notation and assumptions used through- 
out the paper. In Sectional a few useful preliminary results regarding convex sets and c-convex 
functions are listed. Section [J] is devoted to an Alexandrov type inequality which is one of the 
main tools in the proof of our main theorem. Until Section [H we present the theory under 
rather general assumptions. However, from Section [5] we restrict to the multiple products of 
spheres. In Section [5] we state our main result about the stay-away property of optimal maps, 
and give a sketch of the proof. Moreover we explain how one can deduce regularity of optimal 
maps combining this theorem with the results in |FKMlj and [LTWj . Finally, the details of the 
proof of the stay-away property are given in Section [6l 

Acknowledgement: The authors are pleased to thank Neil Trudinger, Tom Spencer, and 
Cedric Villani for useful discussions. 
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2 Notation and assumptions 

In this section and the next we recall notation and results which will be useful in the sequel. 
Many of these results originated in or were inspired by the work of Ma, Trudinger, Wang 
[MTWj and Loeper [LI] . Though the present paper mainly concerns the Riemannian distance 
squared cost c = dist 2 /2 on the product of round spheres, we will present our work in a rather 
general framework. It requires only a small additional effort and may prove useful for further 
development and applications of the theory. 

Let M, M be n-dimensional complete Riemannian manifolds, and let c(x, x) denote a cost 
function c : M x M — > R. We will assume through the whole paper that c is semiconcave 
in both variables, i.e., in coordinate charts it can be written as the sum of a concave and a 
smooth function. Let us remark that since dist 2 (x, y) is semiconcave onMxM (see for example 
\FF\ Appendix B]), the above assumption is satisfied for instance by any cost function of the 
form /(dist (a;, y)) on M x M, with / : R — > R smooth, even, and strongly convex (meaning 
f(d) = f(—d) and f"(d) > for all d > 0). Here and in the sequel we use smooth as a 
synonymous of C°° (though C 4 would be enough for all our purposes). 

As for x and M, we use the "bar" notation to specify the second variable of the cost function. 
Also as a notation we use c(x,x) := c(x,x). We denote by D x and D x the differentials with 
respect to the x and x variable respectively. (For instance, D x D x c{xq,xq) denotes the mixed 
partial derivative of c at (xq,xo).) Let c-Cut(x) denote the c-cut-locus of x G M, that is 

c-Cut(x) := {x G M | c is not smooth in a neighborhood of (x,x)}, 

and let M(x) denote the c-injectivity locus M \ c-Cut(x). Define c-Cut(x), M{x) similarly. 
These sets are open. 

Assumption 2.1 (twist). For each (x,x) G M x M, the maps —D x c(x, •) : M(x) —?■ T*M and 
—D x c(-,x) : M(x) —¥ T?M are smooth embeddings (thus infective). 

We remark that the above hypothesis from Levin [L] is equivalent to condition (Al) in 
[MTWl IL1] IKMlj . which together with the semiconcavity of the cost ensures existence and 
uniqueness of optimal maps when the source measure is absolutely continuous with respect to 
the volume measure (see for instance [L" l IFFl IF] or [VI Chapter 10]). 

The domain of the c- exponential M*{x) in T?M is defined as the image of M(x) under the 
map — D x c(x, •), i.e., 

M*(x) := -D x c{M(x),x) C T|M. 

Define M*(x) similarly. 

As in |MTW] ILlj . we define the c-exponential maps c-Exp x . : M*(x) C T*M — > M and 
c-Exp 5 : M*{x) C T?M — > M as the inverse maps of —D x c(x,-) and — D x c(-,x) respectively, 
i.e., 

p = —D x c(x, c-Exp x p) for p G M*(x), p = —D x c(c-Exp x p, x) for p G M*(x). 

Given a set X, we denote by cl(A") its closure. Define the sub differential of a semiconvex 
funciton a:M-»IRatxGMby 

da(x) := {p G T*M \ a(exp x v) — a(x) > (p, v) + o(|t>| x .) as v — > in T X M} 

(This is non-empty at every point.) Here (•, •) denotes the paring of covectors and vectors. 
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Assumption 2.2. For each (x,x) G MxM the map c-Exp x (resp. c-Exp 2 .J extends to a smooth 
map from cl(M*(x)) (resp. cl(M* (x))) onto M (resp. M). If we abuse notation to use c-Exp z , 
c-Exp s to denote these extensions, then they satisfy 

c-Exp x p = c-Exp x (d x [ — c(x, o-Exp^pm, Vp G cl(M*(x)); 
c-Exp^p = c-F>xp s (d s ( — c(c- Exp 5 p, x))^ , Vp G cl(M*(x)). 

Here, d x , d x denote the subdifferentials with respect to the variables x, x, respectively. 

Note that the above assumptions hold for instance when M = M and c = dist 2 /2 (so that 
c-Exp^. coincides with the Riemannian exponential map exp x ). However, the following three 
assumptions are much more restrictive, and not true for c = dist 2 /2 in general jMTWl ILH 
IKMH ILV] . They are all crucial in this paper. 

Assumption 2.3 (convexity of domains of c-exponentials). For each (x,x) G M x M the 
domains M*(x), M*(x) are convex. 

As shown in |FRVj . the above assumption is necessary for continuity of optimal transport 
maps when the cost function is given by the squared distance. 

A c-segment {x(i)}o<t<i with respect to x is the c-exponential image of a line segment in 
cl(M*(x)), i.e., 

x(t) := c-Exp x ((l — t)po + tpi), for some po,pi G T*M . 

Define similarly a c-segment {x(t)}o<t<i with respect to x. The notions of c- and c-segments, 
due to Ma, Trudinger and Wang, induce a natural extension of the notion of convexity on sets 
in M, M called c-convexity in jMTWj . Let U C M, x G M. The set U is said to be c-convex 
with respect to x if any two points in U are connected by a c-segment with respect to x entirely 
contained inside U. Similarly we define c-convex sets in M. It is helpful to notice that c, c- 
convex sets (with respect to x, x, respectively) are images of convex sets under c-Exp x , c-Exp^, 
respectively. 

Regarding c, c-segments, here comes a key assumption in this paper: 

Assumption 2.4 (convex DASM). For every (x,x) G M x M, let {x(t)}o< t <i, {^(*)}o<t<i 
be c, c-segments with respect to x, x, respectively. Define the functions 

m t (-) := -c(-,x(t)) + c(x, x(t)), fh t {-) := -c(x(t), •) + c(x(t), •), < t < 1. 

Then 

m t < (1 — t)mo + t mi, fht < (1 — t)fho + 1 fh\ < t < 1. (2-1) 

When, instead of (12. If) . only mt < max[mo,mi] and fht < max[mo,?Tii] are required, this 
property played a key role in the work of Loeper |Llj . In |KMlj we called it Loeper's maxi- 
mum principle (DASM), the acronym (DASM) standing for "Double Mountain Above Sliding 
Mountain", a mnemonic which describes how the graphs of the functions mt, fht behave as t 
is varied. For convenience we use this acronym in various places in the present paper. The 
stronger property (convex DASM) was proved in [KM2] to be a consequence of the so-called 
nonnegative cross-curvature condition on the cost c. 

We will also need a strict version of Loeper's maximum principle (DASM): 
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Assumption 2.5 (DASM + ). With the same notation as in Assumption\2.4 



m t (y) < max[m (y),mi(y)] Vy G M, m t (y) < m&x[m Q (y),m 1 (y)] Vy G Af. 

Moreover, when the c-(resp. c-)segument in the definition of nit (resp. frit) is nonconstant, the 
equality holds if and only if y = x (resp. y = x). 

Assumptions 12 .41 and 12 . 51 correspond to a "global" version of the non-negative cross curvature 
assumption and of the (A3) condition of the cost function c, respectively: see jKMlj and 
|MTWj for the definition of nonnegative cross curvature and (A3), respectively. Although the 
equivalence between (convex DASM) and non- negative cross curvature (resp. (DASM + ) 
and (A3)) is not known in general, it holds true for the squared distance cost function on a 
Riemannian manifold, as shown in |FV1 IFRV] . Moreover, Loeper's maximum principle (DASM) 
is a necessary condition for regularity: this is originally shown [Llj on domains in R n and later 
extended to the manifold case jFRV]. 

Given two functions </> : M — > M and (f> : M — >• K, we say that they are c-convex and dual 
with respect to each other if 

4>{x) = sup{— c(x,x) — 4>(x)}, (2-2) 

4>(x) = sup{— c(x,x) — 4>(x)} = sup {— c(x, x) — 4>(x)}. 

x€M xeM 



Since by assumption c is semiconcave, both functions above are semiconvex (see for instance \FF\ 
Appendix A]). This implies in particular that their subdifferentials, dcp(x), dcj>(x) are non-empty 
at every point. 

We define the c-subdifferential d c <j) at a point x as follows: 

d c (j)(x) := {x G M | <j>{y) - </>(x) > -c(y, x) + c(x, x), My G M}. (2.3) 

Analogously, we define d c (j) at every point x. (Recall that c denotes the function defined as 
c(x,x) := c(x, x).) The following well-known reciprocity holds: 

Lemma 2.6 (Reciprocity). For c-convex functions <j), 4> dual to each other as in (12.21) . 

x, G d c (j)(x) <J=^ <j)(x) + <j)(x) = -c(x, x) <J=^> x G d 5 (j)(x). (2.4) 
Proof. Suppose x G d c <p(x). Then, by rearranging the inequality in (|2 . 3[) we get 

-<j>ix) > c(x,x) + sup{-c(y,x) - 4>(y)}, 

y£M 

and the supremum on the right hand side is exactly 4>(x). On the other hand, from the definition 
of and (f> we have 

4>(y) + c(x, y) > -<t>{x) VyGAf, 

so that combining these two inequalities leads to <j)(x) + 4>(x) = —c(x,x), and x G d c <j)(x). The 
opposite implication follows by symmetry. □ 



Loeper [Llj deduced the following fundamental relation to be a consequence of his maximum 
principle (DASM). 
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Lemma 2.7 (Loeper's maximum principle (DASM)). Let Assumptions \2.1\ \2.2\ and \2.3\ 
hold. Suppose Loeper's maximum principle (DASM) holds. Let <f>, <f> be c- convex functions dual 
to each other as in (|2.2|) , Then for all x G M, x G M, 

c-Exp x (d<f>(x)) = d c (fi(x), c-Exp s (d(j)(x)) = d c <p{x). 

Proof. The inclusions c-Exp x (d(j)(x)) C d c (j)(x), c-Fixp x (d<j)(x)) C d c (j)(x) follow from the convex- 
ity of d(j)(x) and the definition of Loeper's maximum principle (DASM). The other inclusions 
hold in general without Loeper's maximum principle. Details can be found in |Llj . □ 

In the following, we refer the conclusion of this lemma also as Loeper's maximum principle 
(DASM). 

For a set C M, the image d c 4>{0) is defined as 

d c (j)(n) := \J d c cj)(x). 

For a c-convex function <p and an open set U G M with xq G U, we define the set [d c (j)(U)] X0 C M 
as 

[d c (/)(U)} X0 := {x G M | <p(x) - <p(x ) > -c(x, x) + c(x , x) for all x € dll}. 

Trivially, d c (f){xo) C [d c (j)(U)] Xo . This definition is justified by the following lemma, which is also 
very useful in later discussions. 

Lemma 2.8. Let Assumptions \2.1\ \2.S\ and \2.3\ hold. Suppose Loeper's maximum principle 
(DASMJ holds. Let (ft be a c-convex function on M. Let U C M be an open set, and let 
xq G U. Then 

(1) [d c (p(U)] Xo is c-convex with respect to xq; 

(2) [d c 4>(U)] X0 C d^iU); 

(3) IfU-t {x }, then both d c (f)(U), [d c (f)(U)} Xo -»• d c c/)(x ). 

Proof. Assertion (1) follows directly from the definitions of Loeper's maximum principle (DASM) 
and of the set [d c (f>(U)] X0 . 

To prove Assertion (2), fix x G [d c (f)(U)] X0 , and move first the graph of the function —c(-,x) 
down so that it lies below <j) inside U, and then lift it up until it touches the graph of (j) inside 
cl(U). Thanks to the assumption x G [d c (p(U)] X0 there exists at least one touching point x' 
which belongs to U (indeed, if there is a touching point on dll, then xo is another touching 
point), and Lemma 12.71 ensures that x G d c cj){x'). 

For (3), the convergence d c <p(U) — > d c (p(xo) follows by continuity, and [d c (p(U)] Xo — > d c 4>(xo) 
comes then from (2). □ 

For x G M, let S(x) be the contact set 

S{x) := {x G M | x G d c (f){x)} = d E 0{x). 

(The last identity follows from reciprocity, see Lemma 12.61 ) For any xo G S(x) one can write 

S(x) = {x G M | 4>(x) — 4>(xq) = —c(x, x) + c(xq, x)}. 
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A set Z in M is called a c-section of with respect to x if there is X s G R such that 

Z := {z G M | 0(2) < -c(z,x) + \ s }. 

The following simple observation is very useful for studying regularity of c-convex functions. 
It was originally made (implicitly) in [FKMlj and independently by Liu [LI] . 

Lemma 2.9 (c-convex c-sections). Let Assumptions \2. i\ \2.2\ and \2.3\ hold. Suppose Loeper's 
maximum principle (DASMj holds. Let </> be a c-convex function on M, and fix x G M . Every 
c-section Z of 4> with respect to x is c-convex with respect to x. 

Proof. This follows from the definition of c-convex functions and Loeper's maximum principle 
(DASM). □ 

Given Borel sets V C M and V C M, we denote by \V\ and |V"| their volume (computed 
with respect to the given Riemannian metric on M and M, respectively). The following is our 
last assumption. As we already remarked in the introduction, it is satisfied whenever <j> is the 
potential associated to an optimal transport map and the densities are both bounded away from 
zero and infinity. 

Assumption 2.10 (bounds on c-Monge- Ampere measure of (j)). There exists A > such 
that 

A|fi| < \d c (j)(VL)\ < for all Borel set O C M . 

A 

We sometimes abbreviate this condition on 4> simply by writing \d c 4>\ G [A, j]. 

3 Preliminary results 

In this section, we list some preliminary results we require later. The first subsection deals with 
general convex sets and the second subsection considers the properties of the cost function under 
suitable assumptions. 

3.1 Convex sets 

We first list two properties of convex sets that will be useful later. 

Lemma 3.1 (John's lemma). For a compact convex set S C W 1 , there exists an affine trans- 
formation L : W 1 — > W 1 such that B\ C L~ 1 (5) C B n . Here, B\ and B n denote the ball of 
radius 1 and n, respectively, centered at 0. 

Proof. See CO. □ 

Lemma 3.2. Let S be a convex set in W 1 = M n ' x R n ", and denote by ir',ir" the canonical 
projections onto M n and M. n , respectively. Let S' be a slice orthogonal to the second component, 
that is 

S' = {it"y x (x") n S for some x" G ir"(S). 
Then there exists a constant C(n), depending only on n = n' + n" , such that 

C{n) \S\ > ,^ n '{S')J^ n "^"{S)), 

where J%? d denotes the d-dimensional Hausdorff measure. 
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Proof. See jFKMll Lemma 7.8]. □ 

The following lemma is important in the last step (Section 16. 6|) of the proof of the main 
theorem. 

Lemma 3.3. Let X = X 1 x . . . x X k , with X 1 = M. ni , i = 1, . . . , k, and write a point x <G X 
as x = (x l ,...,x k ), x l € X 1 . For each i = l,...,k, let be a subset of X % , and let Sj = 
(sj, . . . , s k ) € X with sleW. Define Si C X as 

Si := {sj} x . . . x {st 1 } x Ui x {s? 1 } x . . . x {«*}, 

and consider the convex hull co(S±, . . . , Sk) of the sets S±, . . . , Sk- Then there exists a constant 
C(n, k), depending only on n := n\ + . . . + and k, such that 

C(n, k) | co(5i, ...,Sk)\> U k = i^ ni (S l ). 

Proof. First consider the barycenter b of the set {s\, . . . , Sk}, that is 

b := tO»i + • • • + s k ). 

We will construct sets S% each of which contains b and has Hausdorff measure comparable with 
Si. In addition, these sets are mutually orthogonal. We will finish the proof by considering the 
volume of the convex hull of these sets S^, . . . , S%. 

For each i, let bi be the barycenter of the set {s±, . . . , Sk} \ {si}, i.e., 

bi := Tj— [0*1 + • • • + Si-i + s i+ i + . . . + s k ). 

Consider the cone co(6j, Si) C co(5i, . . . , Sk) and let Si- be the intersection 

S% := co(6i, Si) n {x G X | x j = b> for j + %}. 

Note that b € S% and these sets 5j, . . . , S^ are mutually orthogonal, in the sense that, for each 
x G Sf and y € S*- with £ ^ j, it holds (x — b) ■ (y — b) = 0. Now, consider the convex hull 
co(5j, . . . , S^) C co(5' 1 , . . . , S ). The previous orthogonality implies 

C{n)\ C o{S\,...,S b k )\>Tl k =1 ^{S\). 

for some constant C(n) depending only on n\ + . . . + n^. (This inequality is obtained for instance 
by iteratively applying Lemma [3721 ) To conclude the proof simply observe that b = j:Si + ^r^h, 
and so 

J^{S\)>^M^{Si). 

□ 
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3.2 Coordinate change 



In this subsection we briefly recall the coordinate change introduced in |FKMll Section 4.1] 
that transforms c-convex functions into convex functions under the condition (convex DASM), 
referring to [FKMll Section 4.1] for more details. Throughout this subsection we let Assump- 
tions EU O and O hold. 

Let yo be an arbitrary point in M. Then the map x G M(yo) l— ^ Q 6 T^M given by 
q(x) = —D x c(x,y~o) is an embedding thanks to Assumption 12.11 Recall that M*(y~o) C T? Q M 
denotes the image of this map, that this map is by definition the inverse c-exponential map 
(c-Exp^) -1 , and the c-exponential map is a diffeomorphism up to the boundary of M*(yo) (see 
Assumption 12. 2p . Denote 

c(q, x) := c(x(q), x) - c(x(q), y ). 

Then the c-convex function (j) is transformed to a c-convex function ip defined as 

tp(q) := 4>(x{q)) + c(x(q), y ). 

If Loeper's maximum principle (DASM) holds, then Lemma 12.91 shows that c-sections of (p are 
convex. This property was observed independently by Liu [Li], who used it to derive an optimal 
Holder exponent for optimal maps under the strict condition (A3) on the cost, sharpening the 
Holder continuity result of Loeper |Llj . Furthermore, if (convex DASM) holds then —c(q,x) 
is convex in q for any x G M, which then implies convexity of tp in q (see [FKMll Theorem 4.3] 
for more details). One can easily check that c-segments with respect to x are transformed via 
this coordinate change to c-segments with respect to x, and c-segments with respect to x(q) are 
transformed to c-segments with respect to q. Therefore, Loeper's maximum principle (DASM) 
or (convex DASM) for c implies the same for c. 

3.2.1 Relation between cotangent vectors in two different coordinates 

Here we give an explicit relation between covectors in the new coordinate variable q (as intro- 
duced above) and the original coordinate variable x. Fix arbitrary yo £ M, xq G M(y~o), and let 
qo = —D x c(xo,yo) S T^M. For each z £ M(xq), consider the maps 

z ^ 77(f) := -D x c(x , z) G T* Q M (3.1) 
- z ^ v (t) : = -D q c(q , z) G T* (T^M). 

where c(q, x) := c(x(q),x)—c(x(q),yo) and the variables x and q are related as q(x) = —Dxc(x, yo). 
Denote by M*(xq), M*(qo) the embedding of M(xq) under the mappings z i-> i](z), z h-> p(z), 
respectively. These sets are related by an affine map as we see in the following lemma. In 
particular, from Assumption 12.31 both sets are convex in T* Q M, T* (T?M), respectively. 

Lemma 3.4. Let Assumptions POl and \2.2\ hold. Let n(p) denote the map from M*(q$) to 
M*(xq) that associates p(z) to t](z) as in the relation (13. ip . and let 770 = — D x c(xo, yo) G 
M*(xo) C T* Q M. Fix local coordinates. Then for all p G M*(qo), r](p) = {f]{p) 1 ,--- ,r](p) n ) 
is given as 

rj(py = pi (-D x iD xJ c{xo,yo)) + rf . 

This formula allows the affine function p 1— > rj(p) to be extended to a global map 7] : T* Q (Tg Q M) — > 
T*M. 
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Proof. Observe that 



From the relation 
we see that 



-D x ic(x , z) 

D ^ \x=x [~ c ^' + c ( x ' v°) - c ( x , yo)] 

-D qj \ q=qQ [c(x(q),z) - c(x(q),y )](D x% \ x=xo ql) + rf Q 
Q J = -D s jc(x{q),y ) 



D x i \x=x q = - D x^ D xjc(x ,y ), 



and the assertion follows. □ 

3.2.2 An estimate on the first derivatives of c 
In Section 16.51 we will use the following simple estimate. 

Lemma 3.5. Given convex sets £1, A C M. n , assume that the function (q,y) € fix A i— > c(q, y) GR 
is smooth. Then for all q,q € and y € A we have 

\-D q c(q,y)+D q c(q,y)\<C\q-q\\D q c(q,y)\, (3.2) 

where the constant C depends only on ||c||cr3(Q X A) an< ^ ||(-^gy c ) IU°°(QxA)- 

Proof. See jFKMll Lemma 7.7]. □ 

4 An Alexandrov estimate: upper bound 

In this section we show a key Alexandrov type estimate (|4.ip which bounds from above the size 
of a c-section, say Zh, by its height h. (An estimate that bounds the size of the c-section Zh 
either from above or below by its 'height' h is called Alexandrov type.) This result is of its own 
interest, especially because it is proven under rather general assumptions, and does not rely on 
the special structure of products of spheres. In later sections, a companion inequality showing 
the lower bound will be obtained for a special choice of a c-section in the particular case of 
products of spheres, see Theorem 16.41 

K xez h det(-D x D x c(x,x ))\ 



The key point in the estimate below is that the term maXa:£Z fe 1 """^ ^^2g^ appearing in 

J r mm x< zz h \<iet(-D x D x c(x,xo))\ t> 

(|4.ip can be made as close to 1 as desired, provided one can ensure that the section Zh converges 
to a point as h — > 0. This fact will play a crucial role in the proof of Theorem 15.11 

Lemma 4.1 (Alexandrov upper bound). Let M , M be complete n-dimensional Riemannian 
manifolds. Suppose the cost c:MxM->l satisfies Assumptions ^ A \2.2l \2.cH and \"2.1\ (convex 
DASMj. Let (f> be a c-convex function on M and assume < A < \d c (p\ for a fixed A € M. Fix 
(xo,xq) £ M x M such that xq € d c cp(xo), and for h > consider the c-section Zh defined as 



{x E M | 4>{x) - <f>(x ) < —c(x, x Q ) + c(xq, xq) + h}. 
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Assume that —c(-,xq) is smooth on Zh, so that the function c-Exp^ 1 is defined and smooth on 
Zh, or equivalently Zy t C M(xq). Then the following inequality holds: 



X\Z h \ 2 < C(n) 



max xeZh \det(-D x Dzc(x,x ))\~ 1 ' 



mm x£ z h \det(-D x D x c(x,x ))\ 
with the constant C(n) = (An) n \Bi\ 2 . 



sup sup j dc- Exp J _ , 

xeZ hp >£M*(x) p ~ p 



(4.1) 



Remark 4.2. In the statement of Lemma [4. II and its proof, it is important to notice that by the 
assumption — c(-,xq) is smooth on clZh and Assumptions 12. II and 12. 2\ the derivatives of c-Exp io 
and its inverse (on Zh), i.e., — D x D x c(x, xq), x € Zh, are all nonsingular. 

Remark 4.3. For c = dist 2 /2, Loeper's maximum principle (DASM) (and so also (convex 
DASM)) implies that M = M has nonnegative sectional curvature (see |Llj ). Therefore in this 
case c-Expy is a contraction, that is 

sup |(ic-ExpJ , | < 1. 

P >eM*(x) v ~ v 

We do not know if this contraction property holds for general non-negatively cross-curved cost 
functions. 

Proof. For globally smooth cost functions (on the products of two bounded domains) a similar 
result was proved in |FKM11 Proposition 7.3]. In the present case where the cost function has 
singularities, the previous proof does not work any more and we require the following subtle 
argument. 

As in [FKMlj . we will follow the strategy developed in |Cal| by using renormalization tech- 
niques, but only after a suitable change of coordinates. Consider the coordinate change x 6 
Z h ^ q = -D x c(x{q),x ) € W h C M*(x ) C T* Q M, i.e., x = c-Exp XQ q and Z h = c-Exp^W^), 
and let 

mx(-) ■= ~c(-,x) + c(-,x ). 

As explained in Section [3. 2| in these new coordinates the functions 

q (->■ m x (x(q)) and q ^ ip(q) = <j)(x(q)) + c(x(q), x ) 

are convex. Moreover the set Wh is convex, as 

W h = {q € T| () M | <p(q) - ^(q ) < h}, 

where go is the point corresponding to x in the new coordinates, i.e., c-Exp io g = ^o- It is also 
important to notice that xq £ d c (j)(xo) implies ip(q) — (f(qo) > 0. We now use Lemma 1331 to find 
an affine map A : T* Q M ~ R n ^ T? Q M ~ R n such that A(W h ) = W h , with B x C W h C B n . 
Denote = A(0) and = c-Exp XQ qf,. Define the renormalized function (/3(g) := tp(Aq) for 
each g 6 Wh, and denote \Wh '■= A(hWh) and \Zh '■= c-Exp io (iW^), where \Wh denotes the 
dilation of Wh by a factor 1/2 with respect to the origin. This 1/2-dilation (or any factor in 
(0, 1) works) will be important in this proof. 
Consider the reciprocal expression 

d c ^Z h ) =c-E Wxb (-D x c(x b ,x )+V) 
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where 



V := [D x m s {x b ) | x 6 ^(^h) } C T* b M. 



Here D x m x denotes the differential when m 2 . is differentiable, otherwise it means an arbitrary 
covector in the sub differential dm x (x). Notice that — D x c(xb, xq) + V C cl(M* (#;,)), and thus 



A\z h 



< ( sup |cfc-Exp 

p'£M*(x b ) 



x b \p=p' 



,i m 



(4.2) 



Now, the left-hand side is bounded from below as 



A\z h 



> A 

> A 

> A 



(by the assumption \d c <j)\ > A) 



min I det((ic-Exp 

w&W h 

mm wl =\y h | det (tic- Exp 



^0 I q=w 

So I q=w 



max w& w h | det(<ic-Exp 



l\ n , 
r) W h 



(by iZ fc = 6-Exp So (iW fc )) 



^0 lg=uj 



)|V2 



> A min xgZfc |det(--D j; D^c(x,xo))| /^"j^ 
max x . e z fe | det(-Da;D s c(a;, xq))\ \2/ 



In the following we will bound |V| from above by 

maxa;^ [det(-Z> x JDsc(a;,2Co))| ^ 



(by D x c(-,x ) = c-Exp 5o ) 



(4.3) 



m.m x&Zh | det(-D x D x c(x,x ))\ \Z h \ 



which will finish the proof; here the dilation \Z^ plays a crucial role (see (14. 5j) ). Fix x € 
d c (j)(^Zh), and let g s € such that x € d c <p(q x ). Here, the cost function c is the modified 
cost function accordingly with the coordinate changes: 



Consider the function 



Then 



c(q,y) := c(c-Exp XQ (Aq),y) - c(c-Exp XQ (Aq), x ). 



rh x (q) := m x (c-Exp^ (Aq)) = -c(q,x). 



rh x (q) - rh x (q x ) + <p(q x ) < <p(q) for q £ Wh- 



(4.4) 



We observe that rh x {-) — rh x {q x ) is a convex function on Wh which vanishes at q x 6 nWhi an d 
iTT'x(') — i^x(Qx) < h on dWh- Since B\ C Wh C B n this easily gives rh x (0) — m x (q x ) > — h, 
which by convexity implies 



\D 4 m x {0)\ < 2h. 
To get information on D x m x , observe that from (|4.4p 



(cfc-Exp 



xo\q=q b 



A)*D x m x (x h ) = Darh x (Q), 



(4.5) 
(4.6) 
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where (cfc-Exp^ | .A)* : T* b M i-> Tq (T| Q M) is the dual map of the derivative map dc-Exp^ Q A : 
To(T| Q M) i— > T Xb M. Here we abuse the notation and A denotes both the affine map and its 
derivative. Moreover we use the canonical identification Tq (T? Q M) ~ Tq(T? M) ~ T? Q M. Hence 
(14.51) and (|4.6p imply the key inclusion 



VC^c-Exp.j;^)- 1 ^*)- 1 ^, 



so that 



|V| < Idet^c-Exp.J^J-^det^)- 1 !^!^^" 
= |det(dc-Exp io | 9=9o )|- 1 |detAr 1 |i? 1 |2' 1 / i " 

(by the identification between vectors and covectors) 

< |det(dc-Exp,J 9=9o )r 1 |S 1 | 2 (2n)"p-| (by \W h \ = \detA\\W h \ < |det 

max 1o6W r h |det(do-Exp So | g=w )| i ^ ^ ^ ^ 



< — : - u '«- w '- \B 1 \ 2 (2n) n , 

|det(dc-Exp So | 9=go )| 

(by < max weWu \det(dc-Exp^ \ q= J\\W h \ ) 

<r max ^gzJdet(-£' a; L' :e c(x,xo)|) |p , 2(0 , n h n , , , _ ^ 

mm xe z h \det(-D x Dxc{x,x ))\ \Z h \ 

Together with fj4.2j) and ()4.3|) . this concludes the proof. □ 

5 Stay-away property on multiple products of spheres 

From now on we restrict our attention to the case M = M = M l x . . . x M k , where for each 
i = 1, . . . , k, M % = is a round sphere of constant sectional curvature r~ 2 . Though M = M, 
we sometimes keep the bar notation to emphasize the distinction between the source and the 
target domain of the transportation. Let x = (x 1 , . . . , x k ) and x = (x 1 , . . . , x k ) denote points in 
the product M 1 x . . . x M k , with x\ x % 6 M i , i = 1, . . . , k. Assume that the transportation cost 
c on M is the tensor product of the costs c 1 on each M J , defined as 



c[x, x 



lyo**,**). (5.i) 



Assume moreover that each c % is of the form /*(distj) (distj being the distance on M l ) for some 
smooth strongly convex even function /' : M — > R, normalized so that / l (0) = 0. (This normal- 
ization assumption can be done with no loss of generality, as one can always add an arbitrary 
constant to the cost function.) Moreover we suppose that each c l satisfies Assumptions 12. 1] 12.21 
2.31 12.41 and 12.51 in Section [2J As shown in |KM2j , under these assumptions the tensor product 
cost c also satisfies Assumptions 12. 1| 12.21 12.3l and l2.4l (but not necessarily I2.5|) . The reader should 
have in mind that our model example is c = dist 2 /2, which as shown in [KM2| satisfies all the 
assumptions above. However we prefer to give a proof of the result with general /' since this will 
not cost further effort in the proof, and we believe it may be of interest for future applications. 

Let us observe that for any point x we have M(x) = M 1 (x 1 ) x . . . x M k (x k ) and M*(x) = 
M*(x 1 ) x ... x M*(x k ). Moreover, since that the distance squared function on a round sphere 
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is smooth except for antipodal pairs, for each x % G M % we have Cut(x l ) = {— x 1 }, where —x % 
denotes the antipodal point of x i . (We also write —x = (— x 1 , • • • , —x k ).) This implies easily 
that c-Cut(x) = Cut(x), so that M*(V) = M l \ {— x 1 } and c-Cut(x) is a union of (totally 
geodesic) submanifolds, each of which is an embedding of a product M n x . . . x M li , I < k. 

The goal of the rest of the paper is to show a stay-away property of optimal transport maps 
on products of spheres: 

Theorem 5.1 (Stay-away from cut-locus). Let M = M = M 1 x . . . x M k , where for each 
i = 1, . . . , k, M l = is a round sphere of constant sectional curvature r~ 2 . Let c be the cost 
given in (15. lj) with c % is of the form / l (distj), where f % : R — > M are smooth strongly convex even 
functions such that f l (0) = 0. Assume further that each cost c l satisfies Assumptions \2. il \2.2l 
\2.3[ \2.4\ and \2.h\ and let (ft be a c-convex function satisfying Assumption \2. 1(K Then 

d c <j){x) n c-Cut(a;) = Vx e M. 

Equivalently, for every x € M the contact set S(x) = d c (p(x) satisfies 

S(x) n c-Cut(x) = 0. 

Before sketching the proof of this result, let us first see its consequences: 

Corollary 5.2 (Uniformly stay-away from cut-locus). Use the notation and assumptions 
as in Theorem \5. 1[ There exists a positive constant C depending only on A (see Assumption \2. 10\) 
and ni, ri, f l , for i = 1, • • • , k, such that 

dist (<9 c (/>(x),c-Cut(x)) > C Vx G M. 

where dist denotes the Riemannian distance of M . 

Proof. The result follows by compactness. Indeed, suppose by contradiction there exists a 
sequence of c-convex functions 4>i satisfying Assumption 12.10] and i; 6 M such that 

dist (d c cf)i{xi)i c-Cut(xi)) — y as I — >• oo. 

Up to adding a constant, we can also assume that (pi(xi) = 0. Then, since M is compact and the 
functions (pi are uniformly semiconvex (and so uniformly Lipschitz), applying Arzela-Ascoli's 
Theorem, up to a subsequence there exists a c-convex function (p^ and x^ € M such that 
4>i ~^ 0oo uniformly and X[ — » x^ . We now observe that also satisfies Assumption 12.101 
(see for instance [FKM1, Lemma 3.1]). Moreover, by the definition of c-subdifferential we easily 
obtain 

This implies 

dist (d c <j) OD (x OQ ) , c-Cut(xoc)) = 0, 
which contradicts Theorem 15 .1] and completes the proof. □ 

Corollary 5.3 (Regularity of optimal maps). Let M,M,c be as in Theorem \5.1\ Assume 
that fx and v are two probability measures absolutely continuous with respect to the volume 
measure, and whose densities are bounded away from zero and infinity. Then the unique optimal 
map T from [i to v is infective and continuous. Furthermore, if both densities are C a /C°°, then 
T is C l ' a /C°°. 
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Remark 5.4. The (^'"-regularity result (C 2,a for the potential (f>) in this corollary is a direct 
consequence of the injectivity and continuity of T applied to the theory of Liu,Trudinger and 
Wang |LTWj . The higher regularity C°° follows from Schauder estimates. 

Proof. We recall that, under the assumption that fx and v have densities bounded away from 
zero and infinity, there exists a c-convex function <f) such that T(x) = c-Exp x .(V</>(x)) a.e., and 
cf> satisfies Assumption 12.101 (see for instance |MTW| or [FKMll Lemma 3.1]). Hence it suffices 
to prove that <f> is C 1 and strictly c-convex, in the sense that S(x) = d c (j>(x) is a singleton for 
every x G d c 4>{M) = M. 

To this aim, we observe that once we know that <f> is strictly c-convex, then we can localize 
the proof of the C 1 regularity in [FKMlj to obtain the desired result. Thus we only need to 
show the strict c-convexity of <p. 

Fix x G M. By Theorem 15.11 we know that S(x) C M{x), so that in a neighborhood of 
S(x) we can consider the change of coordinates x i-> q = —Dc(x, x) G TJM. As shown in 
[FKMH Theorem 4.3], thanks to Loeper's maximum principle (DASM) the set S(x) is convex 
in these coordinates. Moreover, since now the cost is smooth in a neighborhood of S(x), by 
[FKMH Theorem 8.1 and Remark 8.2] the compact convex set S(x) in the new coordinates has 
no exposed points on the support of \d c (j)\. Since in our case the support of \d c 4>\ is the whole 
M, the only possibility left is that S(x) is a singleton, as desired. □ 

Sketch of the proof of Theorem I5.il We prove this theorem by contradiction. Assume there ex- 
ists a point xq such that the contact set S(xq) intersects Cut(xo). First, we find a cut-exposed 
point x in S(xq) n Cut(xo). More precisely we split M as M' x M" so that xq = (x ,x'q), 
xo = (x" ,x'q), where x = — x G Cut(x ), x' stays away from the cut-locus of Xq, and x' Q is an 
exposed point in the set S" = {y" G M" \ (— x ,y") G S(xo)} (see Section [6TT]) . Near xq, for 
e G (0, 1) and 5 G [0, 1] we construct a family of points x £: s = (x e ,x s ) such that d(xo, x £ ,s) ~ £+5, 
so that for 5 small we have x £j s G M*(xo), or equivalently xq G M*{x £ ^). By suitably choosing 
the point x s in order to exploit the fact that x' is an exposed point for S" , we can ensure that, 
if Z £: s : h denotes a section obtained by cutting the graph eft with — c(-,x £j g) at height h above xq, 
then for any fixed e G (0, 1) we have Z £j s,h ~ * { x o} as <5, ^ — )• (see Section IfOj) . In particular, 
for g > fixed we have Z £ ^,h C M*{x £) $) for 5, § small (equivalently, the function c(-,x £> $) is 
smooth inside Z e $h). Now we take advantage of the choice of x £ : on the sphere S n the func- 
tion — dist 2 (-,x) looks like a cone near the antipodal point —x, and if dist(x,x e ) ~ e then the 
measure of a section obtained by cutting the graph of — dist 2 (-, x) with — dist 2 (-, x £ ) at height h 
above — x has measure « h n /e (see Proposition 16. 7p . In our case, since x = —x , the function 
4> behaves as —c(-,xq) ks — dist(-,xo) along M' (see Lemma [676]) . Hence by the argument above 
we have an improvement of a factor 1/e in the measure of Z £ s : h ( see Proposition I6.7p . which 
allows to show the following Alexandrov type inequality: 

h dimM < elZ^Wd^Z^l for 5 and f sufficiently small, 

where < is independent of e, 5 and h (see Theorem I6.4p . Thanks to Assumption 12.101 the above 
inequality implies 

h dimM < l\z eAh \ 2 for S and § sufficiently small. (5.2) 
A 

On the other hand, since Z £} s t h C M*(x £i 5) for (5 and 4 small enough, we can apply Lemma 14.11 
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Figure 1: On the sphere, the squared distance function from a point x looks like a cone near —x. So, if 
dist(i,x e ) = dist(— x, — x e ) ~ e, the section obtained by cutting its graph with — dist 2 (-, x e ) at height h 
has measure ps h n /e. 

to Z £ s h and have 



X\Z £Ah \ 2 < C(n) 



max x( z Ze , Sih \ det(-D x Dxc(x,x £i s)\ 
min xeZeSh | det(-D x D x c(x,x £jS ))\ 



sup sup |dc-Exp a 



ip=p' 



The convergence i?^/, — > {xo} as (5, 4 — > further reduces this inequality to 



Al^.a^r < & ^ 5 and § sufficiently small, 

which contradicts (|5.2p as e — > and completes the proof. 

The rest of the paper is devoted to fleshing out the details of the above proof. 



;dim M 



□ 



6 Proof of Theorem 15.11 (Stay- away from cut-locus) 

6.1 Cut-exposed points of contact sets 

Assume by contradiction that there exists xq = (xj, . . . , Xq) € M = M = M 1 x . . . x M k such 
that S(xq) (~l c-Cut(xo) ^ 0. To prove Theorem 15. II a first step is to find a cut-exposed point of 
the contact set in the intersection with the cut-locus, which we define throughout the present 
section. 

Let y € S(xq) fl c-Cut(xo), and note that one of the components of y = (y 1 , . . . , y k ), say y J , 
satisfies y J = —Xq- Moreover we cannot have y = —xq. Indeed it is not difficult to see that, if 
xq G d c <j)(—XQ), then d c (j)(—xo) = M (see for instance Lemma l6.6f l) below), which contradicts 
Assumption 12.101 

Among all points y € S(xq) nc-Cut(xo), choose one such that the number a® of its antipodal 
(or cut-locus) components is maximal, and denote the point by yo. By rearranging the product 
M 1 x ... x M k , we may write with out loss of generality that 



„a „ao+l 
x ' "o 



yi?c-Cut(4) = ao + l,...,fc. (6.1) 



For convenience, use the expression 



M = M = M 1 x . . . x M a \ M" = M" = M ao+1 x . . . x M k 
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The expressions A , A" will be used to denote things defined for elements in M' , M" , respectively. 
For example, 



y=(y\...,y a °), y- = (y ao+ \...,y k ), 



ao 



c(y',y') = ^2c i (y i ,y i ), c(y-,y-)= ^ c i (y i ,y i ) 

i=a +l 



i=l 



Consider the set 



S" = {y- G M" | (-x Q ,y") G 5(x )}. 



Notice that due to maximality of ao, S" C M"(x ) and it is embedded to M "*(x Q ) through the 
map y" i-> q"(y") = — D%- c'(y ", x '). Observe that since S" is compact, the resulting set, say 
5", is compact too. Moreover S" is convex since it is the restriction of the convex set S(xq) to 
M"*(x'q), where S(xq) is the image of S(xo) under the map x h-> —D s c(x,xq). (More precisely, 
this set S(x~o) is defined as the closure of the image of S(x~o)\c-Cut(xo).) This compact convexity 
ensures the existence of an exposed point q for S", that is, there exists an affine function L on 
T*.M" such that 



L(q ) = 0, and L{q ) < V«" G 5" \ 



(6.2) 



(In case S" = {q$} let L = 0.) One should note that if L is such an affine function, then tL is 
also such an affine function for any t > 0. Let xq G S(xq) be the corresponding point of g ' in 
M, that is, 



where 



Xq — (— x ,x ), 



-D x c (xq,Xq). 



(6.3) 



We call this point xo a cut-exposed point of <S(xo), since its components are either cut-locus type 
or exposed. 



M 



yo 



M 



Figure 2: Starting from a point j/o G S(xq) such that the number of its antipodal (or cut-locus) compo- 
nents is maximal, we choose xq = (y' , x ) = {—x , x' ) G M' x M" — M so that x' is an exposed point 
for S" (in some suitable system of coordinates). 

One can assume with a further rearrangement of the product M" = M ao+1 x . . . x M k that 
there exists 6q G {ao, . . . , k} with the following two properties: 
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1. For each i\ € {a + 1, . . . , b^}, there exists y^ € d c (fi(xo) with 

vti = -4- 

2. For every y £ d c (p(xo), 

y 1 ^ — Xq (or equivalently y- 7 ^ c-Cut(rrg) = Cut(xg)) for j = &o + lj • • • ? 

(6.5) 

(If 6 = a , {ao + l,...,6o} = 0O 
After this rearrangement, define 

M' = M' = M 1 x ... x M b ° M" = M" = M bo+1 x . . . x M k 

The expressions A' , ^4" will be used to denote things defined for elements in M', M", respectively. 
For example, 

y = (y', y'% v = (y\ y") eM = M'x M" = m'x m", 

c(y,y)=c\y',y') + c"(y",f), 
and we have the identification 

T| Q M = 1£M' x T?„M", M*{x ) = M'*(x' ) x M"*(x'q). 

In the following n' = dim M' , n" = dimM" and w', vr" denote the canonical projections from M 
to M', M", respectively. 

6.2 Analysis near the cut-exposed point. 

In this subsection we construct a family of c-sections Z e ^ : h of 4> near the cut-exposed point xq 
defined in (I6.3p . Regarding these c-sections, two important results (Proposition 16.21 and I6T31) 
are obtained. In later subsections we will show an Alexandrov type inequality for Z E $f l which 
will be paired with the other Alexandrov type inequality (I4.ip to lead a contradiction to the 
existence of such xq, thus finishing the proof of Theorem 15.11 

Recall the afline function L on Tt.M" given in (|6.2j) . After modifying L by multiplying it 

by an appropriate positive constant, there exists a geodesic curve [0,1] 3 5 i-> x s € M"(x' ) 
starting from x'q such that for the linear map VL on T?..M", 

d 

VL(g -q Q ) = (— t=Q X t ,q" ~ %)■ (6.6) 

Consider a c'-segment [0, 1] B e — > x e 6 M' with respect to x Q connecting the point x to its 
antipodal point Sj/ 2 = x then to x\ = x . {x £ is nothing else than a closed geodesic starting 
from x'q and passing through x = —x at e = 1/2.) Define 

x E>s := (x e ,x s ) G M = M' x M". (6.7) 

Obviously xo,o = Sq. Two important properties follow: 
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(a) Since x £ G M'(x ) for e G (0, 1) and x G M"(xq) we have 

x G M(x £)<5 ) = M\x £ ) x M""^') V0 < e < 1, 5 > small. 

(b) Since x £ s = x £ ^ x for e G (0,1), x 1 = x , and x = — x , for every e G (0,1) and 
5 G [0, 1] we have from Assumption 12.51 for each c l , 

-c{x\x eS ) + c(x ,x £jS ) < -c(x,x ) + c(x ,x ) Vx'GM', (6.8) 

with equality only when (See for instance Lemma 16.51 below.) 

Consider now the c-section Z £t s,h obtained by cutting the graph of eft by the graph of 
— c(-, x £i s) + c(xq, x £> s) + h, that is 

Z £ ,5,h ■= {x G M | (f)(x) - (f)(x ) + c(x, x s>s ) - c(x , x s ,s) < h}. (6.9) 

As it can be easily seen by moving down the graph of — c(-,x Et s) and lift it up until it touches 
the graph of <fi, x £t s G d<p{Z £ ^ ) h)- Hence, thanks to Loeper's maximum principle (DASM) we 
have 

x £tS G d c <P(Z £Ah ). (6.10) 
Proposition 6.1. The following equality holds. 

Z e ,o,o = S(x e>0 ) = S(x ) n ({x } x M"). (6.11) 

Proof. From (16.81) , 

<j)(x) - <j)(x ) + c(x, X £fi ) - c(x ,X e: o) 

= 00*0 - 4>( x o) + c'(x',x £i o) - c'^O'^o) + c"(^">^o) - c "( x Oi x o) 
> 4>(x) - 4>(x ) + c'(x',x ) - c(x ,x ) + c"(x",x'q) - c (x ,x ) 
= 4>{x) - 4>{xq) + c(x, x ) - c(x ,x ) > 0. 

This, together with the equality case for (16. 8h . yields (I6.11j) . □ 

The following two propositions are essential in our proof of Theorem 15.11 
Proposition 6.2. Fix < e < 1. Then, for any sequences 5{, hi — > u>i£/i — ^ 0, iue have 

Z £ ,5i,hi ->• {^o} as « ->• oo. 
Proof. Fix arbitrary sequences 5j, — > 0, and denote 

= hm Z £) s u h l = {-^oo G M | there exists a sequence G Z e s t h t with Zj — >• G M}. 
By continuity, G Z £j o,o for each z^ G Z^, and thus by (|6.1ip z'^ = x . 
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To show z'qq = x'q, we first let 5 > be sufficiently small and fix a small (closed) neighborhood, 
say U, of xq so that all the derivatives (up to the second order) of the function U x [0, 1] 3 (x, t) — > 
c(x,x e js) are uniformly bounded. Then, for x G Z £ sh H U the following inequalities hold: 

h > (p{x) - <f)(x ) + c'(x',x e ) - c'(x ,x e ) + c"(x ",x's) - c '(x ,x s ) 

> — C (x , Xq) + C (Xq, Xq) — C (X , Xq) + C (Xq , Xq) 

+ c'(x',x e ) - c'(x ,x e ) + c"(x" ,x s ) - c"(x ',x 5 ') (since x 6 d°<f)(x )) 

> -c"(x",Xq) + c"(x' ,x ) + c"(x",x s ) -c'(x ,x s ) (by (JEH])) 

d 

> l),c U ".. r n ) - IV'(xq,Xq), — Xts) +0(8 2 ) 

= 6VL(D s c(x\xq) -D s c(x ,x )) + O(5 2 ) (by (USD ) 
Use the coordinate g"(x") = — Dx -c"(x ", x ') to rewrite this as 

VL{q -q,)>-- 6 -0{5). 



Since L{q^) = this gives 



L(g--g )>-^-O(5). 



Consider now the sequences <5,, — >• 0, and any convergent subsequence of Zj G Z £ ^.^ l H [7. For 
the limit z^, let g" = — ZVc(<2Q,, x *). Then g M G 5" (since Zqo G Z £) o,o C <S*(xo) by (|6.1ip ). and 
from the above inequality we get 

which forces = q' Q by (16.2p . This shows z'^ = x ', and thus Z^TxU = {xq}. To finish the proof 
notice that each Z e $ f l is path connected and so is the limit Z^. (This path-connectivity can be 
seen by noticing that the set Z e ^^ is convex in the coordinates q(x) = —D s c(x, x £ ,s) € T| g M.) 
Therefore Z^ = {xq}, as desired. □ 

Proposition 6.3. There exists 5q = 5o(e) > such that, ifO < 5 < Sq, < h < 5 2 , then for each 
V = (y'iV") € d°(j){Z e sh) the component y" stays away from the cut-locus of the component z" 
ofz (i.e., y" G M"{z")) for every z G Z eM . Equivalently 7t"(d c 4>(Z eM )) C fUeZ,,,,* M"{z"). 

Proof. Suppose the statement is false along some sequence Si, hi — ► with hi < Sf, and let 
Xi,Zi G Z £jSjh , y~i G d c (p(xi) be such that y'( G Cut(zf). Since Z £jSi>hi ->■ {x }, both x h Zi -> x . 
Moreover if is a cluster point for {j/j}j G N) then G d c cp(xo) and G Cut(x '). This 
contradicts the choice of M" (see (|6.5p ) and concludes the proof. □ 

6.3 An Alexandrov type estimate near the cut-exposed point 

We state the main theorem for the rest of the paper. 

Theorem 6.4 (Alexandrov lower bound near cut-exposed point). Fix < £ < 1, and 

let Z e gh be as in (|6.9p . There exists 5i = 5i(e) > so that, if < 5 < 5\, then there exists 
hi = hi(e,5) such that 

h dimM < e ao \Z £Ah \\d c HZ £ ,s, h )\ V0 < /* < h^S), (6.12) 
where < is independent of e,5 and h. 
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This result concludes the proof of Theorem 15.14 since for e > small enoug h and 5, § 
we have Z £ ^,h —> {xo} (by Proposition 16. 2p . and (|6.12p is in contradiction with (|4.ip . 

The following subsections are devoted to the proof of Theorem 16. 4( that we divide into 
three parts. First, in Section 16.41 we get Alexandrov type estimates for the sets obtained by 
the intersection of Z e gfi with the cut-locus components of xo- I n Section [6.5| we analyze the 
projection Tr"(Z £ g t f l ) of Z e ^h onto the regular component M" of xq. We construct a suitable 
convex set, say C, which has size comparable to the image d c (f){Z £) $ ) h), and we get a version of 
the estimate (|6.12p involving C and ir"(Z £: s : h) (see Proposition 16.8( 3)). Finally in Section [6761 
we combine these results and conclude the proof. 

6.4 Proof of Theorem 16.41 (Alexandrov lower bound near cut-exposed point): 
analysis in the cut-locus component M' 

The main result of this section is Proposition 16.71 that gives an Alexandrov type estimate for the 
intersection of Z £ ^,h with the cut-locus components of xo- 
We start with a few elementary results. 

Lemma 6.5. Let § n be the standard round sphere, and c(x,x) = /(dist(x,x)) for (x,x) G 
S n x S n , where f is a smooth strictly increasing function f : R + — > R + . Assume that c satisfies 
A s sumption 1 2. 51 (DASM+j. Then, for every x,x G S n , 

— c(— x, y) + c(— x, x) > —c(x, y) + c(x, x) Vy G § n , 

where —x denotes the antipodal point of x. Moreover equality holds if and only if y = x. 

Proof. For any x, x G § n , one can find a c-segment x(s) with respect to x such that x(0) = x(l) = 
— x and x(sq) = x for some sq G [0, 1]. The inequality (together with the characterization of the 
equality case) then follows from (DASM + ) for the function fh s (-) = —c(x(s), •) + c(x(s), x). □ 

For each 1 < i < k and z € M, let M* denote the i-th slice of M through z, that is 

M l z := {x € M | = for j ^ i}. 

The following lemma generalizes the fact that on M = M = S n with c = dist 2 /2, if x E S n and 
-x G a c 0(x), then d c <f){x) = S n . 

Lemma 6.6. Let M,M,c be as in Theorem \5.1\ Let <f> be a c-convex function on M. Fix 
z = (z 1 , . . . , z k ) G M = M 1 x ... x M k and an open set U with z G U. Fix i G {1, . . . , k}, and 
let z G M with z l = —z l . The following holds: 

(1) Ifze [d c (t)(U)} z (resp. z G d c <p{z)), then M\ C [d c <j){U)] z (resp. M| C d c (f>(z)). 

(2) Suppose z G d c <j)(z). Then, for each x G M\, <p(x) — <j)(z) = —c l (x\ —z l ) + c l (z\ —z % ). 
Proof. To prove (1) it is enough to observe that for x G M| and x G M, 

— c(x, x) + c(z, x) 

= -c\x\x i ) + ciz^x 4 ) + [ - c>(x j ,z j ) + (9{x\ z j )] 

< -c\x\ -z*) + c(z\ -z l ) + ^2 [ - c>(x j ,z j ) + c J (z j ,z j )] (by Lemma[63D 
= —c(x, z) + c(z, z) (since z 1 = —z l ). 
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The last line is bounded from above by 4>{x) — <fi(z) either if x G dU or z G d c 4>{z). 

Let us prove the (2). Suppose z G d c 4>{z). By duality (Lemma 12. 6p . z G d c cf>(z) for the 
dual c-convex function cj>. Applying (1) to 4> we get M* G d c 4>(z), or equivalently M* C S(z). 
Therefore for all x G M l z we have 

— cj)(z) = —c(x, z) + c(z, z) 

= -c\x\ -z l ) + c l {z\ -z l ) (since x j = z j for j ^ i) 

which concludes the proof. □ 

Let i G {1, . . . , &o}> i- e -i M l is a component of M'. Recall that xo is the cut-exposed point 
defined in (|6.3p . By definition of &o i n (|6.4p and (|6.5p . there exists |/j G <9 c (^(xo) such that 
y\ = — Xq. (If i < oo then one can choose yj = xo-) Let Z l eSh := ir l (Z £i s t h H Af* ) for the 
canonical projection 7r* : M — >• M\ Then Lemma I6.6f 2) implies 

^H^M* | -J(x\-x^+J(x} ) ,-x^ + c i (x\xl 5 )-c i (xlxl s )<h}. (6.13) 

Here comes the main result of this section. 

Proposition 6.7. There exist 82 = 82(e) > such that, if < 5 < 62, then there exists 
h-2 = h2(s, 5) such that the set Z % e s h satisfies the following estimates for < h < h,2-' 

h^ Mi <s\Zi Ah \ ^/l< ^ <ao, 
h dimAP <\Zt Ah \ ifa + l<i<b 0} 

where < is independent of e,5 and h and \Z l eSh \ denotes the Riemannian volume in the sub- 
manifold M 1 . 

Proof. From (I6.13P and Lemma 16.51 we have Z % e Sh — > {4} as h — > 0. Thus for sufficiently small 
h we can embed Z l eSh into G T^M 1 by i-> (f(x J ) = — D sl c l (x\ x l ). Let W£ be its image. 
Then 

|Wj|<( max 1-^^(^,4)1) |^ M |<|^ M | 

for /i sufficiently small. In the following we bound |W^| from below. 

Without loss of generality, assume M l is the unit sphere. Let q$ = q t (x t ). By abuse of 
notation use c l (q l ,x l ) to denote c l (x l (q l ),x l ), and renormalize this cost function as 

4^) = i[cW + - c'Gzj.S*)] 

Then (|6.13|) implies = hWfc + q l , where 

Wl^WeT^M* I -4(^,-4) + 4(g\4, 5 )<l} 

Recall c* = / l (distj) for some smooth nonnegative uniformly convex function /* : R + — >• M + 
such that / l (0) = 0, -*-(0) = 0. Thus, as h — >• the renormalized cost —c l h (q l , —x l ) converges 
to the conical function 

^iWI, for g'GT'M* 
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(Here, we used distj(x , — x l ) = it.) 

Case I: If 1 < i < ao, then x l e s = x £ % , and so c l h (q, x l £ s ) converges to the linear function 



where 



■ ■ ■ dP dP 

\D q c\qlx*)\ = -L( v - 2vr £ ) > -Lfr) - Ce 



for some constant C > 0. (Here, we used dist(xQ,x e *) = tt — 2ire.) Therefore in the limit h — >• 
one can easily check that 




a < i < b Q 



Figure 3: If 1 < i < ao then — x l Q = x l and disti (— x l , x\ s ) « e, so the size of the section is of order 1/e 
(see also Figure 1). On the other hand, if ao < i < bo then — x l Q ^ x l Q , which implies that dist^ (— x l , x l e s ) 
is uniformly bounded away from 0, and the size of the section is of order 1. 



limM>- : 

/i-)-0 £ 



and thus for h > sufficiently small 



\wi\ = h 



dim AT |^| > 



,dimM 8 



Case II: If a < i < bo, then x % eS = x' & % . Similarly as for the above case, c l h (q, x l e & ) converges 
to the linear function 

Since x s G M"(x' ) for 5 > small enough, there exist positive constants C,Co such that 

■ ■ dP dP 

\D q c\qlxr)\ < ^(vr - C) < ^(vr) - Co, 

where for the last inequality we used the uniform convexity of /. From this one can check that 
lim/^o | Wfr | > 1 , and thus for sufficiently small h > 

m\ >h dimM \ 



This concludes the proof. 



□ 
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6.5 Proof of Theorem 16.41 (Alexandrov lower bound near cut-exposed point): 
analysis in the regular component M" 

The main result of this subsection is Proposition 16.81 Fix < e < 1, and assume that 5 
and j are sufficiently small so that, as in Proposition 16.31 the set Z £ ^h is close to the cut- 
exposed point xq, and so in particular Z £j g i f l C M(x £i $). Consider the change of coordinates 
q G T£ e g M h-> x(q) G M(x £: s) induced by the relation 

q = -D s c(x(q), x £jS ), (6.14) 

and let Z Et s t h € T£ & M be the set Z e> s,h i n this chart. The function <p and the cost c are 
transformed to 

tp(q) = 4>(x(q)) + c(x(q), x £)S ), 

and 

c{q, y) = c(x{q),y) - c(x{q), x £>s ) for (q, y) G T^ g M x M. 

Notice that 

c{q,x £ ,s) = 0, 
and <p is a c-convex function on TJ M. Moreover 

Z £ ,S,h = Tl s M | <p(q) - <p(q ) < h] 

where go is the point corresponding to xq in this new chart. It is important to recall that, thanks 
to Assumption 12.41 (convex DASM), c and tp are convex. (See Section f3.2j) 
We have the natural decomposition (with obvious notation) 

q = (q',q") = (-feW), x' e<s ), -D- X ,,c" {x" {q>% x'^)) (6.15) 

eT| m = t;, m' x t;„ m". 

(Here, one should keep in mind that, by the definition of x £ ,s, the component x" $ in M" does 
not depend on e.) The modified cost c(q, y) has the decomposition 

c(q,y)=c'(q',y') + c"(q",y") 

where 

c'(q',y') = c'(x'(q'),y') - c f (x'(q'),x' £>s ) for q> G T*, M>, 

and 

c"(q", f) = c'(x"(q"),f) - c"(x"(q"),xy for q" G T*„ M" . 

E,S 

Let tt'tx" denote the canonical projection from X M onto T*, M' and T*„ M", respectively. 

e > 4 x e,s X E,6 

Now, let us construct a convex set C C T* Q (T? M) that we will use later to estimate 
\d c (p(Z £: s : i l )\ from below (see Proposition I6.9|) . The strategy of the proof follows the lines of the 
one of |FKMH Proposition 7.6]. 

Proposition 6.8. Fix < e < 1, and assume that < 5 < 5q and < h < 5 2 , with 8q as 
in Proposition \ 6.3l Then there exists a convex set C G T* (T£ g M) satisfying the following 
properties: 
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(1) C C {0} x T*„(T*„ M") C T* (T* M') x T*,(T*„ M"): 

(2) c-Exp qo C = {ze M\-d q c(q ,z)nC + 0} C [d c cj)(Z EA h)}x C d c (f>(Z £)S)h ), where 8 q denotes 
the subdifferential with respect to q variable; 

(3) Jt n " (C)J^ n " (n"(Z EjS>h )) > h n '\ where > is independent ofh,6,e. 

Proof. In the following, we first construct such a set C and then we show the desired properties. 
The set C will be given as a convex hull of certain covectors pi, ■ ■ • ,p n "i see (|6.22|) . We go 
through several steps. 

First we find some auxiliary covectors p\ , . . . , p n » . From Lemma 13.11 applied to the convex 
set tt"(Z e g h), there is an ellipsoid E such that 

E C n"(Z eM ) C n"E (6.16) 

where the scaling n"E is with respect to the barycenter of the ellipsoid. Let p", 1 < i < n" , 
denote the unit orthogonal covectors parallel to the axes of the ellipsoid E, and denote by a{ the 
length of the i-th principal axis of E. Find hyperplanes II" C T*„ M" that are orthogonal to p'[ 

x e,S 

and touch tangentially the boundary of Tr'\Z E ^ t h) at points q", 1 < i < n" . Let q^ be the point 
in T*„ M" corresponding to xq, and denote by l{ the distance from q§ to IF/. Then, thanks to 

e,S 

(|6.16p we have 

n" n" 

l[li< H(2n" ai ) < ,^ n "^"{Z eM )). (6.17) 

i i 

For each <£, there exists q\ G T*, M' such that the hyperplane IF* := T*, M' x IF/ C T^ g M 
tangentially touches the boundary dZ £i s,h at the point qi = {q'^q'D- Let Xi = c-Exp ie5 gj. Since 
Pi = (0,p") is orthogonal to IF and Z et s,h is a sublevel set of the convex function tp, there exists 
a scalar multiple t{ G such that tiPi G dip(qi). By Assumption 12.21 and Loeper's maximum 
principle (DASM) (Lemma \2.7\i , the point z% = c-Exp qi Upi satisfies z% G d c (p(qi) = d c 4>(xi). 
Note that in fact, 

Zi = c-Exp qt tiPi = c-Exp x .rj(tiPi) 

where r\ is the afflne map given by Lemma 13.41 (in whose statement we replace xq , qo and y~o 
with Xi, qi and x Ei $, respectively). Moreover, using the decomposition 

Pl = (0,p'/) G T*,(T*, M') x T*„(T*„ M'% 

we see that the c-segment (with respect to qi) 

[0, l]9ti-> Zi{t) = c-Exp ? .(l - t)tiPi = c-Ex Pa .. ((1 - t)n(tiPi)) 
from Zi(0) = Zi to Zj(l) = x Ej g, is of the form 

z i (t) = (x' EjS ,z'/(t))eM , xM". 
Observe that by Proposition 16.31 and Assumption 12. 3| we have 

z'l(t) G M"(x"), V< G [0, 1], Vx G Z eAh . (6.18) 
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We use these c-segments Zi(t) to define the points p^, i = 1, • • ■ , n" . Define the function 

mzi(t){q) '■= -c{q,Zi(t)) + c(qi,Zi(t)) + ipfa). 

Clearly, m 5i ( ) < an d "^(l) = PiQi) = h + (f(qo)- By continuity there exists Tj G [0, 1) such 
that 

m^ Ti )(q ) = <p(qo). 
Also, Loeper's maximum principle (DASM) implies 




Figure 4: The supporting function m^to) = rriz i touches ip at from below. By interpolating between 
m zi = m zi(o) arL( f m s E s = m 5i(i) along the c-segment with respect to we can find Tj G [0, 1) such that 
mj.( r .)(qo) = </>(<Zo)- Then the covectorpi used to construct C is defined as f>i := (0, —D q "C h '(q'o, z" (t^)) G 

5m Zi(r,)(9o)- 



m 2i ( Tj ) < max[/i + </?(g ),¥>], 

so that in particular 

m 2 . (r .) < on dZ e ,<S,ft, 
hence, by the definition of [d c (j){Z £ ^^)\ XQ , 

Zi(Ti) G [aV(^ £ ,5,h)] go = [d C( t>{ Z eAh)]x for every i = l,... ,n" . 
For later use, consider the nonzero vectors 

Vi{n) = (i - n)kPi 

= (0,(l-r^') 

= (o, -d^'(4, z'Kn))) g r;,(r|, m') x t;„(t|„ m"), * = !,-•■ 

Clearly these vectors are all mutually orthogonal. Moreover, because 

Pi(n) G dm 2i{Ti) (qi), i = 1, 
we have by the convexity of m 5 .( T .), 

tp{qi) - ip(q ) h 



,n". 



\Pi(Ti)\ > 



(6.19) 



(6.20) 



dist(g ,nj 



(6.21) 
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To finish the construction of C, let 

Pi := (0,-D q „c"(q^zUn)) (6.22) 

e t;,(t;,m') x t*„(t*,,m'% i = 1, .. .,n". 

Notice that ^(r^) = (x^. s ,Zi(Ti)") = c-Exp^p^. Let C = co(]5i, . . . ,p n ") be the convex hull of 
pi, . . . ,p n n . In the following, we will see that C satisfies the desired properties (1), (2) and 
(3). First, (1) follows immediately from (|6.22|) . while (2) is a direct consequence of (|6.19p and 
Lemma 12.81 

Now, let us show (3). By (16. 18ft each Zi(ji)" stays uniformly away (for small 5, h) from the 
cut-locus of ir"(Z e ^s,h)- Hence we can apply Lemma [331 to (|6.20p and (|6.22p to see that pi is close 
to Pi(ri) when we use the canonical identification T*.(T* s g M) « T*(T* s S M); more precisely, 

\Pi -Pi(n)\ < o h (i)\pi\, 

where 0^(1) is a quantity which goes to as h 0. Since the vectors {Pi(Ti)} i=1 n „ are all 
mutually orthogonal, pi are almost mutually orthogonal covectors, which by (I6.21|) satify 

\Pi\ = Wl\>Hn)\>j, 

(Here, for sufficiently small <5 and j, the inequality > and the almost orthogonality are indepen- 
dent of 5, h and e.) This gives 

■ 1 *i 

This estimate combined with (|6.17p shows (3). This completes the proof. □ 

6.6 Proof of Theorem 16.41 (Alexandrov lower bound near cut-exposed point): 
final argument 

In this section we finish the proof of Theorem 16.41 Let < e < 1, and fix < 5 < 5\(e) := 
mm{5o(e), 62(e)} and < h < h\(e,5) := mm{5 2 , fi2(e, 6)}, with 5o(e) and 62(e), ti2(e, 5) as in 
Proposition 16.31 and 16.71 respectively. The estimates <, >, ~ in this section are all independent 
of e, 5 and h. 

To make use of the results of previous sections, we need the following comparison result: 

Proposition 6.9. The set C constructed in Proposition \6.8\ satisfies 

J? n "(C)<\d c ^(Z eAh )\. 

Note that even with Proposition 16.81 (2). this estimate is not obvious because n" < dimM. 

Proof. For each e, 5, h as in Proposition 16.81 we will find an auxiliary set A = A £: s,h C T> in a 
fixed (thus independent of e, 5, h) compact set V C M*(xq) C T* q M such that 

c-Exp Xo (^) C [d c <P(Z eM )] X0 C d c <P(Z eM ); (6.23) 
\A\>Jtr"{6). (6.24) 
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Once such a set is constructed, the desired estimate follows from 

\d c 4>(Z £M )\ > \c-Exp XQ (A)\ > \A\ (since A C V). 

The construction of A goes through several steps. First, apply to the set C the (extended) map 
V G T q ( T s E S M) !->• G T X * M as in Lemma El] (with y = x £jS ), and let r](C) C T* Q M denote 
its image. Notice that by Proposition 16.8( 2) 

c-Exp X0 (r ? (C)) = c-Exp go C C [d c <t>(Z E , 5>h )] X0 . 

Let us compare Jif n " (i](C)) with Jf? n " (C). For each p = (0,p") G (5, Lemma 137*11 applies as 

i7(p) = « a , /(-JVA^(4,<«)) +<f), 
where 77^,5 = -D x c(x , x e ,s) (thus, c-Exp^ (77^,5) = x e> s). Therefore 

V(C) C x T* X „M 

and 

JT n " (77(C)) = \detD x „D x „c"(4,x'l s )\J^ n "(C) « jr n "(C). 

Notice that a^' 5 is independent of e (see (|6.7p ) and stays uniformly away from Cut(xo), so that 
the above estimate is independent of e, 5 and h. 

We now use a convexity argument to construct A. We will first construct some suitable sets 
C l , . . . , C b °, and Co, inside a fixed compact set (independent of e, (5, h) in M*(xo), which satisfy 
the properties of the sets Si in Lemma 13.31 These sets will also satisfy: 

c-Exp a:o (C 1 )U...U C -Exp :Eo (C'' )U C -Exp :Eo (Co) C [d c cj>(Z eAh )] X0 ; 

^{C l ) >1, i = l,...b ; 

^ n "{Co)>^ n "ij]{C)). 

Then A will be given as the convex hull of these sets, that is A = co(C 1 , . . . , C b °, Co). By 
convexity of M*(xo), A will be in a fixed compact set, say P, independent of e,5,h, and the 
c-convexity of [d c (j)(Z £) s ) h)]xo (see Lemma [278]) will imply c-Ex.p XQ (A) C [d c 4>{Z £ ^^h)\ XQ , showing 
(|6.23[) . We will then apply Lemma 13.31 to get 

l^l>^ n "(r?(c))«jr""(c), 

which gives (I6.24D . Hence we are let to construct C 1 , . . . , C b °, C . 

To construct C 1 , . . . , C b ° , recall that M' = M 1 x . . . x M b ° , and for every i G {1, . . . , b } there 
exists yi G d c <j){xo) with y\ = — x . Moreover Mg. C d c 4>(xo) by Lemma 15761 We further observe 
that the same inclusion holds for all the components y\ of yi that satisfy y\ = — x l . Hence, once 
fji has a cut- locus component with xo, then one can change such component arbitrarily, and 
the resulting point still remains inside d c <j)(xo). Combining this fact with Loeper's maximum 
principle (DASM) we can find a covector Vi and a set C l C dcf>(xo) C T* Q M, with v j G C % whose 
components are either v\ = or -u' G M**(4), and 

C* = {g G r* M I 2g i G M"(4), and g' = «| for I ^ i }. 
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Clearly, C* is compact and C % C M*(x ). Moreover c-Exp XQ C l C <9 c (/>(xo) C [d c 4>(xo)] xo and 
Jf? ni (C l ) > 1. Also, observe that the construction of C 1 , . . . , C bo is independent of e, 5, /i. 

Let us now construct the set Co- From Propositions 16.21 and 16.31 we see that for 5 and 
j sufficiently small there exists a compact set C" C M"(x'q) (independent of e,S,h) with 
tt" (d c (p(Z £j s.h)) C C". Recall the definition of ao, i»o, Xe,8 = (x' £ ,Xg), and that x' s G M"(x '). 
Then we write r/ £ 5 = (?7 e ,ryV) € T*. M' x T*-M" and we observe that r/ 5 ' is uniformly away from 
the boundary of M"*(x' Q ). These facts imply that there exists a compact set C 2 ' C M"*(a; ') 
(independent of e,S, h) such that 

7/(C) C K} x C 2 . 

However, ?i e — > 9M'*(s ) as e — >• 0, thus 77(C) is not kept in a fixed compact set in M*(xq). 
In particular, we cannot take 77(C) for Co, and this motivates the following: Since x £ = — x 
and x £j s G [d c 4>{Z £ ^^)} Xo , applying Lemma as in the previous paragraph we see that the set 
M' x {x s }, in particular, (x ,x'g) belongs to [d°cj){Z £ ^ h)]x - This point (x ,x s ) corresponds to 
the covector (0, rj' s ). Consider the cone co ((0, 77^') , 77(C)), and define Co as 

Co := co ((0,7/5) , 77(C)) n G T; M I q G T*. q M"). 

By a simple geometric argument 

J? n "(C )>J? n "(r,(C)), 

and moreover, since % € ^M'*(a; ), the set Co is contained in a fixed compact set in M*(xq) 
independently of e,5,h. By c-convexity of [d c 4>(Z £t s t h)]x j 



c-Exp(C ) C [d c <P(Z eAh )] 



x - 



Note that by construction this set Co, together with C 1 , . . . , C b °, satisfy the property of the sets 
Si in Lemma 13.31 Furthermore they are in a fixed compact set in M*(xq) independent of e, 5, h. 
This completes the proof. □ 

Combining Propositions 16.91 and 16.8( 3) we obtain 

h n " < 3^\ir\Z eAh ))\d c <j>{Z EAh )\. (6.25) 

We will finish the proof by applying Proposition 16.71 First, we need some preliminary steps. 
Use the notation given in Section [631 Let Z' eSh be the slice of Z e> s t h in M' x {x'q}, that is 

Z' eAh :={x' EM> \(x',x%)eZ £Ah }. 

Then Z' £ Sh is embedded via x' i-> —D x >c'(x',x' £ s ) into Z' £ Sh C M'*(x' £ s ), where 

Z' £Ah := W G M'*« )(5 ) I (g',^') G Z eM }. 

Embed in the same way each Z £ Sh (see (|6.13p ). i = 1, ... bo, into Z £ s h C M"(x* 5). 

Proposition 6.10. Assume that < 5 < 5q and < h < 5 2 , with 5q as in Proposition Iff. 31 
T/ien the following inequalities hold: 

min \tet(D x ,D x ,c\x\x' & ))\)\Z' & A < \Z' S A < ( max | det(D x ,D x ,c'(x> , x' s ))\) \Z' s J, 
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min |detp^c i (xS4 )(5 ))|)|4 (5i J<|^ M |<( max \ det ( D x*D x ic\x\x E5 ))\)\Z i 5h 



where \ ■ \ denotes the Riemannian volume (in the appropriate submanifold) . 
Proof. From (pU4"|) 

D x ,q' = -D x ,D s ,c'(x\q'),x' £ j), 
and so the first inequality follows from 



det D x /q\ dx . 



e,6,h 



The proof of the second inequality is analogous. 
By convexity and Lemma 13.31 one has 



□ 



bo 



M' n '{Z' eAh )>J{^{Z i 



i=l 



while Propositions 16.101 and 16.71 imply 



b 



i=i 



> 



b 




n 


min 


r b 




n 

L i=l 


I min 



J e,5,h\ 
h n ' 

e a ° 



Combining these estimates with (|6.25p we get 



< p a 



6o 



J( min [det(£> a .<D S iC i (x i J aj* ji ))| 



i=l 
6o 



T( min |det(Z) a .<I> S iC i (x i ,x*i))| 

<e*>\Z e , Si hW4>{.Z*Ah)\, 
where the last inequality follows from 



jr n (z' eAh) je n ^"(z EAh )Wct>(z £M ) 



\Z eAh \\d c <j)(Z £Ah )\ (byLemmaEj 



\Z e ,8,h\<[ max det(D x D x c(x,x e j)))\Z eAh \ 



(see Proposition 16. 10[) and 

max x , eZ / g h det(D x ,D x ,c'(x' x' e S )) 



< 1 as 5, | ^ 



(see Propositions 16.21 and [673]) . This concludes the proof of Theorem 16.44 and Theorem 15.11 is 
proved. 



32 



Y. Brenier, Polar factorization and monotone rearrangement of vector-valued func- 
tions. Comm. Pure Appl. Math., 44 (1991), 375-417. 

L. A. Caffarelli, A localization property of viscosity solutions to the Monge-Ampere 
equation and their strict convexity. Ann. of Math., 131 (1990), 129-134. 

L. A. Caffarelli, Some regularity properties of solutions of Monge Ampere equation. 
Comm. Pure Appl. Math., 44 (1991), no. 8-9, 965-969. 

L. A. Caffarelli, The regularity of mapping with a convex potential. J. Amer. Math. 
Soc., 5 (1992), 99-104. 

L. A. Caffarelli, Boundary regularity of maps with convex potentials II. Ann. of Math., 
144 (1996), 453-496. 

L. A. Caffarelli, Monotonicity properties of optimal transportation and the FKG and 
related inequalities. Comm. Math. Phys., 214 (2000), no. 3, 547-563. 

D. Cordero-Erausquin, Sur le transport de mesures periodiques. C. R. Acad. Sci. Paris 
Ser. I Math., 329 (1999), 199-202. 

P. Delanoe, Classical solvability in demension two of the second boundary value problem 
associated with the Monge-Ampere operator. Ann. Inst. Henri Poincare-Anal. Non Lin., 
8 (1991), 443-457. 

P. Delanoe, Gradient rearrangement for diffeomorphisms of a compact manifold. Diff. 
Geom. Appl., 20 (2004), 145-165. 

P. Delanoe and Y. Ge, Regularity of optimal transportation maps on compact, locally 
nearly spherical, manifolds. To appear in J. Reine Angew. Math. 

P. Delanoe and G. Loeper, Gradient estimates for potentials of invertible gradient 
mappings on the sphere. Calc. Var. Partial Differential Equations, 26 (2006), no. 3, 
297-311. 

A. Fathi and A. Figalli, Optimal transportation on non-compact manifolds. To appear 
in Israel J. Math. 

A. Figalli, Existence, uniqueness, and regularity of optimal transport maps. SIAM J. 
Math. Anal., 39 (2007), no. 1, 126-137. 

A. Figalli, Y.-H. Kim and R. J. McCann, Continuity and injectivity of optimal maps 
for non-negatively cross-curved costs. Preprint, 2009. 

A. Figalli, Y.-H. Kim and R.J. McCann. When is a multidimensional screening a 
convex program? Uniqueness and stability of optimal strategies in the principal-agent 
problem. Preprint at www.math.toronto.edu/mccann. 

A. Figalli and L. Rifford, Continuity of optimal transport maps on small deformations 
o/S 2 . Comm. Pure Appl. Math., 62 (2009), no. 12, 1670-1706. 



33 



[FRV] A. Figalli, L. Rifford and C. Villani, Necessary and sufficient conditions for continuity 
of optimal transport maps on Riemannian manifolds. In preparation. 

[FV] A. Figalli and C. Villani, An approximation lemma about the cut locus, with applica- 
tions in optimal transport theory. Methods Appl. Anal., 15 (2008), no. 2, 149-154. 

[G] C. Gutierrez, The Monge-Ampere Equation. Birkhauser, 2001. 

[J] F. John, Extremum problems with inequalities as subsidiary conditions. Studies and 

Essays Presented to R. Courant on his 60th Birthday, January 8, 1948, 187-204. Inter- 
science Publishers, Inc., New York, N. Y., 1948. 

[K] Y.-H. Kim, Counterexamples to continuity of optimal transportation on positively 

curved Riemannian manifolds. Int. Math. Res. Not. IMRN 2008, Art. ID rnnl20, 15 
pp. 

[KM1] Y.-H. Kim and R. J. McCann, Continuity, curvature, and the general covariance of 
optimal transportation. Preprint at arXiv:0712.3077. To appear in J. Eur. Math. Soc. 

[KMla] Y.-H. Kim and R.J. McCann. Appendices to original version of Continuity, 
curvature, and the general covariance of optimal transportation. Preprint at 
arXiv:math/0712.3077vl. 

[KM2] Y.-H. Kim and R. J. McCann, Towards the smoothness of optimal maps on Riemannian 
submersions and Riemannian products (of round spheres in particular). To appear in 
J. Reine Angew. Math. 

[L] V.L. Levin. Abstract cyclical monotonicity and Monge solutions for the general Monge- 

Kantorovich problem. Set-valued Anal. 7 (1999) 7-32. 

[LI] G. Loeper, On the regularity of solutions of optimal transportation problems. Acta 
Math. 202 (2009), no. 2, 241-283. 

[L2] G. Loeper, Regularity of optimal maps on the sphere: The quadratic cost and the 
reflector antenna. To appear in Arch. Ration. Mech. Anal. 

[LV] G. Loeper and C. Villani, Regularity of optimal transport in curved geometry: the 
nonfocal case. To appear in Duke Math. J. 

[Li] J. Liu. Holder regularity of optimal mappings in optimal transportation. Calc Var. 
Partial Differential Equations 34 (2009) 435-451. 

[M] R. J. McCann, Polar factorization of maps on Riemannian manifolds. Geom. Funct. 

Anal., 11 (2001), 589-608. 

[LTW] J. Liu, N. Trudinger and X.-J. Wang, Interior C 2,a -regularity for potential functions 
in optimal transportation, To appear in Comm. Partial Differential Equations. 

[MTW] X.-N. Ma, N. Trudinger and X.-J. Wang, Regularity of potential functions of the optimal 
transport problem. Arch. Ration. Mech. Anal., 177(2): 151-183, 2005. 

[S] T. Sei, A Jacobian inequality for gradient maps on the sphere and its application to 

directional statistics. Preprint at arXiv:0906.0874. 



34 



[TW] N. Trudinger and X.-J. Wang, On the second boundary value problem for Monge- 
Ampere type equations and optimal transportation. Ann. Sc. Norm. Super. Pisa CI. 
Sci. (5) 8 (2009), no. 1, 143-174. 

[U] J. Urbas, On the second boundary value problem for equations of Monge- Ampere type. 

J. Reine Angew. Math., 487 (1997), 115-124. 

[V] C. Villani, Optimal Transport, Old and New, Grundlehren des mathematischen Wis- 

senschaften [Fundamental Principles of Mathematical Sciences], Vol. 338, Springer- 
Verlag, Berlin-New York, 2009. 



35 



